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19 Abstract (250 words) 

20 The fusion peptides (FP) play an essential role in fusion of viral envelope with cellular 

21 membranes. The location and properties of the FPs in the spike (S) glycoproteins of different 

22 coronaviruses (CoV) have not yet been determined. Through amino acid sequence analysis of S 

23 proteins of representative Co Vs, we identified a common region as a possible FP (pFP) that 

24 shares the characteristics of FPs of Class-I viral fusion proteins including high Ala/Gly content, 

25 intermediate hydrophobicity, few charged residues. To test the hypothesis that this region 

26 contains the CoV FP, we systemically mutated every residue in the pFP of Middle East 

27 Respiratory Syndrome betacoronavirus (MERS-CoV), and found that 11 of the 22 residues in the 

28 pFP (from G953 to L964, except for A956) were essential for S protein-mediated cell-cell fusion 

29 and virus entry. The synthetic MERS-CoV pFP core peptide ( 955 IAGVGWTAGL 964 ) induced 

30 extensive fusion of liposome membranes, while mutant peptide failed to induce any lipid mixing. 

31 We also selectively mutated residues in pFPs of two other (3-CoVs, Severe Acute Respiratory 

32 Syndrome Coronavirus (SARS-CoV) and Mouse Hepatitis Virus (MF1V). Although the amino 

33 acid sequences of these two pFPs differed significantly from that of MERS-CoV and each other, 

34 most of the pFP mutants of SARS-CoV and MF1V also failed to mediate membrane fusion, 

35 suggesting that these pFPs are also the functional FPs. Thus, the FPs of 3 different lineages of (3- 

36 CoVs are conserved in location within the S glycoproteins and in their functions, although their 

37 amino acid sequences have diverged significantly during CoV evolution. 

38 Importance (150 words) 

39 Within the Class-I viral fusion proteins of many enveloped viruses, the FP is the critical mediator 

40 of fusion of the viral envelope with host cell membranes leading to virus infection. FPs from 

41 within a virus family, like influenza viruses or human immunodeficiency viruses (HIV), tend to 


D 

o 


of Virology JVI Journal of Virology Accepted MdnUSCH pf Posted OfTlme 


42 share high amino acid sequence identity. In this study, we determined the location and amino 

43 acid sequences of the FPs of S glycoproteins of 3 P-CoVs: MERS-CoV, SARS-CoV, and MHV, 

44 and demonstrated that they were essential for mediating cell-cell fusion and virus entry. 

45 Interestingly, in marked contrast to the FPs of influenza and HIV, the primary amino acid 

46 sequences of the FPs of P-CoVs in 3 different lineages differed significantly. Thus, during 

47 evolution the FPs of P-CoVs have diverged significantly in their primary sequences, while 

48 maintaining the same essential biological functions. Our findings identify a potential new target 

49 for development of drugs against Co Vs. 

50 
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51 Introduction. 

52 Viruses are obligate intracellular parasites, and host cell membranes act as a barrier to 

53 virus entry. Enveloped viruses initiate infection of cells through fusion of the viral and cellular 

54 membranes. CoVs are enveloped and single stranded plus sense RNA viruses that cause a variety 

55 of diseases among many different species (1). Phylogenetically, CoVs are divided into four 

56 genera: alphacoronavirus (a-CoV), betacoronavirus (|3-CoV), gammacoronavirus (y-CoV), and 

57 deltacoronavirus (8-CoV) (2). 

58 CoVs enter cells through the interactions of the viral S proteins with host receptors. 

59 Several cellular proteins have been identified as receptors for their respective CoVs. Specific 

60 examples include human angiotensin converting-enzyme 2 (hACE2) for SARS-CoV and human 

61 CoV NL63 (3, 4), human dipeptidyl peptidase IV (hDPP4) for MERS-CoV (5), bat DPP4 for bat 

62 CoV HKU4 (6), human aminopeptidase N (hAPN) for human CoV 229E (7), mouse 

63 carcinoembryonic antigen-related cell adhesion molecule la (mCEACAMla) for MHV (8). 

64 The CoV S protein is a Class-I viral fusion proteins. On the CoV virions, the 180-200 

65 kDa S proteins are found as trimers. S monomers contain two subunits called SI and S2. SI 

66 contains the receptor binding domain (RBD) and is responsible for receptor recognition and 

67 binding, whereas S2 possesses the membrane fusion machinery (9, 10), including a fusion 

68 peptide (FP), two heptad repeat domains (called the N-terminal and C-terminal heptad repeats, 

69 HR-N and HR-C), the juxtamembrane domain (JMD) and a transmembrane domain (TMD) (Fig 

70 1A). 

71 To mediate membrane fusion, S protein must be activated, which requires both 

72 proteolytic cleavage (priming) and receptor binding with or without pFI change (triggering) (11- 

73 13). Several host priming proteases are important for S protein mediated CoV entry, including 
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74 cathepsin B and L, serine protease TMPRSS2 and 4, trypsin, elastase, HAT, and furin (14-20). S 

75 protein activation leads to a series of conformational changes and insertion of a putative FP into 

76 target membrane, an essential step in membrane fusion and virus infection. Class-I viral fusion 

77 proteins generally contain one FP, located either internally, like the FPs of the glycoprotein (Gp) 

78 of Ebola virus and the envelope protein (Env) of avian sarcoma leukosis virus (ASLV) (21-24), 

79 or immediately down stream of the “priming” site, as seen in the hemagglutinin (HA) of 

80 influenza and the Env protein of HIV (25, 26). Although the primary sequences and lengths of 

81 FPs vary significantly among different Class-I viral fusion proteins, they share several common 

82 features. Most are rich in Ala and/or Gly, have an intermediate level of hydrophobicity with 

83 membrane binding potential, form helical structures in the presence of trifluoroethanol (TFE), 

84 and contain very few charged resides in the middle of their sequences (13, 25, 27). 

85 Although significant efforts have been made to locate the FPs of different CoVs (28-35), 

86 the exact locations and sequences of CoV FPs remains controversial. While Chambers et al 

87 predicted that the CoV FP was likely adjacent to HR-N (ref), Manu et al proposed that the 

88 sequence immediately following a critical and conserved trypsin cleavage site at the arginine of 

89 position 797 (R797) of SARS-CoV S protein, SFIEDLLFNKVTLADAGF, may be the FP of 

90 SARS-CoV S protein (32). In this study, we used bioinformatics to identify a 17-22 amino acids 

91 long region, just upstream of HR-N, in S2 of different CoVs with characteristic features of the 

92 FPs of other Class-I viral fusion proteins. Using mutational, biochemical, and biophysical 

93 analyses of this region of the S proteins of 3 P-CoVs, MERS-CoV, SARS-CoV, and MHV, we 

94 provide data to support this region as the functional FP of CoV S proteins. 

95 Materials and Methods 
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96 Cell culture. HEK-293, 293T, HEK-293 cells stably expressing hACE2 (293/hACE2), HeLa 

97 cells stably expressing hDPP4 (HeLa/hDPP4), and HeLa cells stably expressing mouse 

98 CEACAMla (HeLa/mCEACAMla) were maintained in Dulbecco’s modified Eagle’s medium 

99 (DMEM) (Invitrogen, Carlsbad, CA) supplemented with 10% fetal bovine serum (FBS) and 2% 

100 penicillin-stereptomycin-fungizone (Invitrogen) at 37 °C with 5% CO 2 . 

101 Constructs and mutagenesis. The constructs, pcDNA-SARS-CoV SA19 (36), pcDNA-MERS- 

102 CoV SA16 (37), and pcDNA-MHV S (38) have been described previously. Briefly, DNA 

103 encoding codon-optimized SARS-CoV S protein lacking the last 19aa, or MERS-CoV S protein 

104 lacking last 16aa but with a FLAG tag at the C-terminus, or full length MHV S protein was 

105 cloned between BamH I and Not I sites of pcDNA3.1. All SARS-CoV, MERS-CoV, and MHV S 

106 mutants were derived from the plasmid pcDNA-SARS-CoV SA19, pcDNA-MERS-CoV SA16, 

107 and pcDNA-MHV S, respectively. All mutagenesis was carried out using Q5 mutagenesis kit 

108 (NEB, MA, USA). After the entire coding sequences were verified by sequencing, the BamH I 

109 and Not I containing mutated S gene was cloned back into pcDNA3.1. A plasmid encoding full- 

110 length hACE2 (pACE2-cq) was kindly provided by M. Farzan (Scripps Research Institute, 

111 Florida campus). A plasmids encoding full-length human DPP4 (pcDNA-hDPP4) was purchased 

112 from Sino Biological Inc (Beijing, China). A plasmid encoding full-length mouse CEACAMla 

113 (mCEACAMla) has been described previously (39). To express soluble human ACE2 (shACE2) 

114 and soluble human DPP4 (shDPP4), DNA fragments encoding residues 19-615 of human 

115 hACE2 with N-terminal 6his and FLAG tags and residues 40-766 of human DPP4 with C- 

116 terminal 6his and AVI tags were cloned between Sal I and Hind III and between BamH I and 

117 Xho I of modified pFASTBacl vector with gp67 signal peptide, respectively. To express soluble 

118 mouse CEACAMla (smCEACAMla), residues 1-236 of mCEACAMla with C-terminal 6his 


D 

o 


cfWology JVI Journal of Virology Accepted MdnUSCfTpf Posted OfTlme 


119 and AVI tags were cloned into EcoR I and Not I of pFASTBacl. These soluble receptors were 

120 expressed in High Five insect cells using the bac-to-bac system (Invitrogen) and purified using 

121 nickel affinity and ion-exchange chromatography. 

122 Analysis of S protein expression on cell surface. Briefly, HEK-293T cells were transfected 

123 with 2 pg of either wild-type or mutant S protein-expressing plasmid using polyethyleneimine 

124 (PEI) (Polyscience Inc, Warrington, PA, USA). Forty hours later, cells were detached from 

125 plates by incubating with PBS+lmM EDTA for 5min at 37°C. After washing, cells were 

126 incubated with the respective primary anti-S antibody for 1 hour on ice. The primary antibodies 

127 for SARS-CoV SA19, MERS-CoV SA16, and MHV S protein were rabbit polyclonal anti-SARS 

128 SI antibody (1:300 dilution) (Sinobiological Inc, Beijing, China), mouse monoclonal anti-MERS 

129 S antibody (1:300 dilution) (Sinobiological Inc, Beijing, China), and goat polyclonal anti-MHV 

130 S antibody (A04) (1:200 dilution), respectively. After washing, cells were stained with Alexa 

131 Fluor 488 conjugated goat anti-rabbit IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for SARS S, 

132 or goat anti-mouse IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for MERS S, or rabbit anti- 

133 goat IgG (1:200) (ZSGB-Bio LLC, Beijing, China) for MHV S. After washing, cells were fixed 

134 with 1% paraformaldehyde and analyzed by flow cytometry. 

135 Binding of soluble receptor. HEK-293T cells were transfected with plasmids encoding either 

136 wild-type or mutant S proteins with PEL After 40 hours, cells were lifted with PBS+lmM EDTA 

137 and immediately washed twice with PBS+2% normal donkey serum (NDS). About 2xl0 5 cells 

138 were incubated with 1 pg of shACE2, or shDPP4, or smCEACAMla for 1 hour on ice. After 

139 washing, cells were incubated with mouse monoclonal anti-FLAG M2 antibody (1:1,000 dilution) 

140 (Sigma, St Louis, MO, USA) for shACE2 and followed with Alexa Fluor 488 conjugated goat 

141 anti-mouse IgG (1:200), or rabbit polyclonal anti-AVI antibody (1:200 dilution) (Shanghai 
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142 Enzyme-linked Biotechnology Co., Shanghai, China) for shDPP4 and smCEACAMla, and 

143 followed with Alexa Fluor 488 conjugated goat anti-rabbit IgG (1:200). Cells were fixed with 

144 1% paraformaldehyde and analyzed by flow cytometry. 

145 Production and transduction of S protein-pseudotyped lentiviruses. Pseudovirions with 

146 spike proteins were produced as described previously (40) with minor modifications. Briefly, 

147 plasmids encoding either wild-type or mutant S proteins were co-transfected into 293T cells with 

148 pLenti-Luc-GFP (a gift from Dr. Fang Li, Duke University) and psPAX2 (Addgene, Cambridge, 

149 MA) at a molar ratio of 1:1:1 by using PEL The following day, the cells were fed with fresh 

150 medium. After 24 hrs incubation, the supernatant media containing pseudovirions were 

151 centrifuged at 800g for 5min to remove debris, and passed through a 0.45-um filter. To quantify 

152 S protein-mediated entry of pseudovirions, susceptible cells were seeded at about 25-30% 

153 confluency in 24-well plates. The following day, cells were inoculated with 500ul of 1:1 diluted 

154 viruses. At 40 hours post-inoculation (PI), cells were lysed at room temperature with 120pl of 

155 media with an equal volume of Steady-glo (Promega, Madison, WI). Transduction efficiency 

156 was monitored by quantitation of luciferase activity using Modulus II Microplate Reader (Turner 

157 Biosystem, Sunnyvale, CA). All experiments were done in triplicate and repeated at least three 

158 times. 

159 Detection of viral spike glycoproteins by western blot. To evaluate S protein expression in 

160 cells, HEK 293T cells were transfected with plasmids encoding either wild-type or mutant S 

161 proteins by using PEL Forty hours later, cells were lysed with lysis buffer (50 mM Tris-HCl 

162 pH7.4, 150 mM NaCL 2 mM EDTA, 1% Triton X-100, 0.1% SDS) containing protease 

163 inhibitors (Roche, USA). To determine S protein incorporation into pseudotype virions, the 

164 virion-containing supernatant was pelleted through a 20% sucrose cushion at 30,000 rpm at 4°C 
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165 for 2 h in a Beckman SW41 rotor (40). Viral pellets were resuspended into PBS. Cell lysates and 

166 pseudovirion pellets were separated on a 4-15% SDS-PAGE and transferred to a nitrocellulose 

167 blot. The SARS-CoV SA19, MERS-CoV SA16, and MHV S proteins were detected with 

168 polyclonal rabbit anti-SARS SI antibodies (1:2,000), monoclonal mouse anti-MERS S antibody 

169 (1:1,000), and polyclonal goat anti-MHV S antibody (1:2,000), respectively, and the blots were 

170 further stained with horseradish peroxidase conjugated antibodies, respectively: goat anti-rabbit 

171 IgG (1:10,000), goat anti-mouse IgG (1:10,000), and rabbit anti-goat IgG (1:10,000), and 

172 visualized with Clarity Western ECL substrate (Bio-Rad, Hercules, CA, USA). The (3-actin and 

173 HIV capsid protein (p24) were detected using mouse monoclonal anti-|3-actin antibody (1:5,000) 

174 (Sigma, St Louis, MO, USA) and rabbit polyclonal anti-p24 antibody (1:5,000) (Sinobiological 

175 Inc, Beijing, China), respectively. 

176 Cell-cell fusion assays. Cell-cell fusion assays were performed as previously described (37) with 

177 modifications. Briefly, 293T cells were co-transfected with plasmids encoding CoV S 

178 glycoprotein and GFP. Forty hours later, cells were detached with trypsin (0.25%) and overlaid 

179 on a 70% confluent monolayer of 293/hACE2, or HeLa/hDPP4, or HeLa/mCEACAMla cells at 

180 a ratio of approximate one S-expressing cell to two receptor-expressing cells. After overnight 

181 incubation, images of syncytia were captured with a Nikon TE2000 epifluorescence microscope 

182 running MetaMorph software (Molecular Devices). To quantify S protein mediated cell-cell 

183 fusion, 293T cells were co-transfected with pFR-Luc, which contains a synthetic promoter with 

184 five tandem repeats of the yeast GAL4 binding sites that controls expression of the luciferase 

185 gene, and plasmid encoding S protein, and the receptor-expressing cells (293/hACE2, 

186 HeLa/hDPP4, or HeLa/mCEACAMla) were transfected with pBD-NFKB, which encodes a 

187 fusion protein with DNA binding domain of GAL4 and transcription activation domain of NFkB. 
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188 The following day, S expressing 293T cells were lifted with trypsin and overlaid onto receptor 

189 expressing cells at a ratio of about one S-expressing cell to two receptor-expressing cells. When 

190 cell-cell fusion occurred, luciferase expression would be activated through binding of the GALT- 

191 NFkB fusion protein to GAL4 binding sites at the promoter of the luciferase gene. After 24 hrs 

192 incubation, cells were lysed by adding 120pl of medium with an equal volume of Steady-glo, and 

193 luciferase activity was measured with a Modulus II Microplate Reader. All experiments were 

194 done in triplicate and repeated at least three times. 

195 Peptide synthesis. All peptides were synthesized using a standard solid-phase FMOC (9- 

196 fluorenylmethoxy carbonyl) method by Scilight Biotechnology LLC (Shanghai, China). 

197 Purification was carried out by reversed-phase high-performance liquid chromatography (FIPLC), 

198 and verified by mass spectrometry. An Ahx-KKK linker was added to all peptides used in 

199 circular dichroism (CD) spectroscopy analysis to increase peptide solubility in PBS. Peptides for 

200 CD analysis include: CTRL: KWGQYTNSPFLTKGF-Ahx-KKX, a control peptide from a 

201 previous SARS study (33); HIV FP (41): AVGIGALFLGFLGAAG-Ahx-KKK; and MERS pFP: 

202 S SLLG SI AG VG WT AGL S SF AAI-Ahx-KKK. Peptides for lipid mixing study include: CTRL: 

203 KWGQYTN SPFLTKGF; HIV FP: AVGIGALFLGFLGAAG; MERS short FP (sFP): 

204 IAGVGWTAGL; MERS mutant FP (mFP): IAGRGRTAGL. 

205 CD spectroscopy. CD spectroscopy analysis was performed to study the secondary structure of 

206 fusion peptides in increasing trifluoroethanol (TFE) concentrations. CD spectra were acquired on 

207 a Jasco J-815 spectropolarimeter (Jasco, Tokyo, Japan) using a 1-nm bandwidth with a 1-nm step 

208 resolution from 195 to 260 nm at room temperature. Spectra were corrected by subtraction of its 

209 respective solvent. The sample spectrum was smoothed with a Savitsky-Golay filter. The a- 

210 helical content was estimated from the ellipticity value at 222nm, [ 6 ( 222 , according to the 
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211 empirical equation of Chen et al (42): %helical content=100* ([0]222/-395OOOx(l-2.57/n)) , 

212 where n is the number of peptide bonds. 

213 Preparation of liposomes. Equimolar amounts of egg phosphatidylethanolamine (PE), egg 

214 phosphatidylcholine (PC), and cholesterol (Avanti Polar Lipids, Alabaster, Ala., USA) were 

215 dried from chloroform into a thin film by constant flow of nitrogen gas, and rehydrated in Tris 

216 buffer (10 mM Tris, 150 mM NaCl, 0.1 mM EDTA, pH7.2) at a concentration of 10 mM. Large 

217 unilamellar vesicles (LUV) were prepared by the extrusion procedure (43). Briefly, after ten 

218 freeze-thaw cycles, liposomes were extruded 21 times through two stacked polycarbonate 

219 membranes with a pore size of 0.1 pm using an Avanti mini-extruder. Liposome with 0.6% 

220 (molar ratio) fluorescent resonance energy transfer (FRET) pairs Rho-PE and NBD-PE (Thermo 

221 Fisher) were prepared in the same way. 

222 Lipid mixing. Lipid mixing was determined using the resonance energy transfer assay, described 

223 by Struck et al (44) with minor modifications. Briefly, Rho-PE and NBD-PE labeled liposomes 

224 were mixed with unlabeled liposomes at a ratio of 1:9. The final lipid concentration was 300 pM. 

225 Specified amounts of various peptides were added to initiate fusion, and changes in fluorescence 

226 were monitored at 535 nm with the excitation wavelength set at 465 nm and a slit width of 4 nm 

227 using Fluromax-4 (Floriba, Paris, France). The initial residual fluorescence of the labeled and 

228 unlabeled vesicles was set up as baseline for 0% fluorescence value (fo); 100% fluorescence 

229 value (fioo) was achieved by addition of Triton X-100 to final concentration of 0.2%. The extent 

230 of lipid mixing was calculated using the following formula: %F t =(f t -fo)/(fioo-fo)*100, where f ( is 

231 the fluorescence value observed after addition of fusion peptide at time t. 

232 Results 
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233 During membrane fusion, the FP of S proteins inserts into the host membranes. We 

234 reasoned that CoV FPs might share some common properties with the transmembrane domains 

235 (TMD) and that the location of the FP within the S protein might be predictable by using TMD 

236 prediction software programs. The FPs of FIIV-1 Env and influenza FIA have been studied 

237 extensively and their locations and amino acid sequences are known. As a proof of concept, we 

238 first tested whether TM software programs could accurately identify the FPs of FIIV-1 Env and 

239 influenza H1N1HA proteins. Both the FPs and TMDs of HIV-1 Env and influenza HA were 

240 accurately identified by two software programs, TMpred 

241 ( http://www.ch.embnet.org/software/TMPRED form.html ) and TMHMM 

242 (http://www.cbs.dtu.dk/services/TMHMM/) (Data not shown). Subsequently, we applied these 

243 two software programs to analyze S proteins of a wide variety of CoVs. The positions of the 

244 TMDs of the S proteins of all Co Vs studied were correctly identified by both software programs 

245 (Fig IB). In addition, both of these TMD prediction programs identified another region 

246 consistently flanked by YT at the N-terminus and PF at the C-terminus in all of the S proteins of 

247 the CoVs tested (Figs. IB and 1C). Although the primary amino acid sequences of this region 

248 were not conserved in all of the CoVs studied, they were all Ala or Gly rich, relatively 

249 hydrophobic, and contained no charged residues, characteristics shared by the FPs of other 

250 Class-I viral fusion proteins (Fig. 1C). We named this region in CoV S proteins the possible FP 

251 (pFP). 

252 To investigate if the pFP is the functional fusion peptide of CoVs, we selected the S 

253 protein of MERS-CoV, a lineage C P-CoV, as an example. The MERS-CoV pFP contains amino 

254 acids 949 to 970 (Fig. 1C). Individual and occasionally double amino acid substitutions were 

255 introduced at each position ofpFP (Fig. ID). First, we determined if any of the mutations altered 
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256 the expression of S protein in 293T cells. Consistent to our previous report (37), two bands 

257 around 200 kDa were detected in the cell lysate of 293T cells expressing wild-type (WT) S 

258 protein, likely reflecting the different glycosylation of full length S proteins during transport 

259 through the Golgi apparatus. However, the cell lysate also contained a significant proportion of S 

260 protein cleaved between SI and S2, around 100 kDa, which was absent in our previous report, 

261 but previously reported by the Pohlmann laboratory (45). The difference between this study and 

262 our early report likely resulted from different culture conditions, especially sera and media from 

263 different vendors. Among the total 44 G, A, V, or R substitutions, 30 (S949G, S950G, L952A, 

264 G953A, G953R, S954G, S954R, I955G, A956V, A956R, G957A, G957R, V958G, V958R, 

265 I955G/V958G, G959A, G959R, W960G, W960R, V958G/W960G, T961A, A962V, A962R, 

266 G963A, G963R, L964G, L964R, S965G, S966G, and A968V) showed no or minor effects on S 

267 protein expression or processing when compared to WT(Fig. 2A and Table 1). On the contrast, 

268 14 substitutions (L951G, L952G, L951G/L952G, S965R, S966R, F967G, L964F/F967G, A968R, 

269 A969V, A969R, I970G, P971V, F972G, and I970G/F972G) showed significant reductions in S 

270 protein expression and changes in patterns of S protein processing (Fig. 2A and Table 1). The 

271 cleaved S protein species were almost absent in corresponding cell lysates, suggesting that these 

272 residues (L951, L952, S965, S966, F967, A968, A969,1970, P971, and F972) maybe important 

273 for S protein folding and processing. 

274 We then investigated if any amino acid substitutions in the pFP influenced transport of 

275 the S protein to the cell surface. The 293T cells expressing WT or mutant S proteins were 

276 incubated on ice with mouse monoclonal anti-MERS-CoV S protein antibody and analyzed by 

277 flow cytometry. The same 30 mutants that showed WT levels of S protein expression in cell 

278 lysates also showed WT levels of S protein on the cell surface (Fig. 2B and Table 1). As 
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279 expected, the mutants with defects in S protein expression and processing also showed only low 

280 levels of S proteins on the cell surface. 

281 Although the pFP is located within the MERS-CoV S2 subunit, amino acid substitutions 

282 in pFP might affect S protein binding to its cognate receptor, hDPP4, by altering the overall 

283 conformation of the S protein. To determine whether or not any amino acid substitution in pFP 

284 changed S protein binding to hDPP4, we used V5-tagged soluble hDPP4 (shDPP4) to bind 293T 

285 cells transiently expressing WT or pFP mutant S proteins of MERS-CoV. The percentage of cells 

286 that bound shDPP4 and the level of shDPP4 bound to S protein were quantitated by flow 

287 cytometry. The same 30 mutant S proteins that showed WT levels of expression on cell surface 

288 also bound to shDPP4 at levels similar to WT S protein (Fig. 3 and Table 1), indicating that these 

289 pFP mutations had no effect on receptor binding. 

290 Because the fusion peptide is essential for S protein-mediated membrane fusion, we then 

291 explored whether any mutation in pFP altered MERS-CoV S protein-mediated cell-cell fusion. 

292 To more easily visualize cell-cell fusion or syncytia, the 293T cells expressing S protein were co- 

293 transfected with a GFP-expressing plasmid, then overlaid with FIeLa/hDPP4 cells in the presence 

294 of trypsin. Consistent with our previous report (37), WT MERS-CoV S protein induced very 

295 large syncytia (Fig 4) and syncytia formation depended on the availability of hDPP4 (data not 

296 shown). Among 30 pFP S protein mutants that were expressed well, transported to the cell 

297 surface efficiently, and bound to hDPP4 at levels similar to WT, 14 mutants (S949G, S950G, 

298 G953A, S954G, A956V, A956R, G957A, V958G, G959A, A962V, G963A, L964G, S965G, and 

299 A968V) induced large syncytia in FIeLa/hDPP4 cells similar to WT, while 12 mutants (G953R, 

300 S954R, I955G, G957R, V958R, I955G/V958G, W960R, V958G/W960G, T961G, A962R, 

301 G963R, and L964R) induced little or no syncytia formation, and 4 mutants (L952A, G959R, 
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302 W960G, and S966G) induced syncytia of much smaller size than WT (Fig 4). These results 

303 indicate that these 13 residues, L952, G953, S954,1955, G957, V958, G959, W960, T961, A962, 

304 G963, L964, and S966, in MERS-CoV S protein are critical for S protein-mediated, receptor- 

305 dependent membrane fusion that would lead to virus infection. 

306 To quantify the effect of amino acid substitutions on S protein-mediated syncytia 

307 formation, we utilized a luciferase-based quantification assay from a yeast two hybrid system 

308 from Stratagene-Agilent Technologies, Inc. Compared to mock transfection and parental HeLa 

309 cell controls, fusion of 293T cells expressing WT MERS-CoV S proteins with HeLa/hDPP4 cells 

310 increased luciferase activity by about 1,000-fold (Fig.5). The overall pattern of cell-cell fusion 

311 induced by pFP mutants in this quantification assay was very similar to our visual method (Fig. 4 

312 and 5, Table 1). Among the same 30 mutants showing WT level of expression and receptor 

313 binding, 16 mutants (S949G, S950G, L952A, G953A, S954G, A956V, A956R, G957A, V958G, 

314 G959A, A962V, G963A, L964G, S965G, S966G, and A968V) retained 50-110% of WT level 

315 fusion activity, but 14 mutants (G953R, S964R, I955G, G957R, V958R, I955G/V958G, G959R, 

316 W960G, W960R, V958G/W960G, T961G, A962R, G963R, and L964R) reduced S protein- 

317 mediated cell-cell fusion by more than 85%, indicating that these residues (G953, S954,1955, 

318 G957, V958, G959, W960, T961, A962, G963, and L964) are essential for membrane fusion. 

319 To determine whether or not any mutation in the pFP of the S protein of MERS-CoV also 

320 affected virus entry, we measured transduction of HeLa/hDPP4 cells by lentiviral pseudovirions 

321 with envelopes containing either WT or pFP mutant MERS-CoV S proteins. Compared to mock 

322 control (pseudovirions without any S protein), the luciferase activity in HeLa/hDPP4 cells 

323 increased by more than 10,000 fold following transduction by pseudovirions with WT MERS- 

324 CoV S proteins (Fig. 6A). Among the same 30 mutants that showed little or no effects on S 
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325 protein expression or receptor binding (Figs 2A, 2B, 3, and Table 1), 5 mutants (L952A, G953A, 

326 G953R, G963R, and S966G) showed marked reduction in S protein incorporation into 

327 pseudovirions, whereas the S proteins of the other 25 mutants were incorporated into 

328 pseudovirions as well as WT S protein (Fig. 6B). Ten out of these 25 amino acid substitutions, 

329 S954R, I955G, G957R, V958R, I955G/V958G, W960R, V958G/W960G, T961G, A962R, or 

330 L964R, almost abolished MERS-CoV S protein-mediated, receptor-dependent pseudovirion 

331 entry (Fig 6A and Table 1), suggesting that S954,1955, G957, V958, W960, T961, A962, and 

332 L964 are essential for virus entry. In addition, G959R mutation also reduced the transduction by 

333 more than 95%, indicating that G959 may also be critical for virus entry too (Fig. 6A). 

334 Interestingly, although G953A, G953R, and G963R mutants showed reduced but similar levels 

335 of S protein incorporation into pseudovirions (Fig. 6B), the infectivity of the pseudovirions 

336 differed drastically. While G953A result in only 30% of WT level of pseudovirion entry, the 

337 G953R and G963R mutations almost abrogated S protein mediated pseudovirion entry, 

338 indicating that G953 and G963 may also be important for virus entry. 

339 Because the FPs of most Class-I viral fusion proteins fold predominantly in an a-helix 

340 structure in the presence of TFE (13), we used circular dichroism spectroscopy (CD) analysis to 

341 investigate whether our MERS-CoV pFP also adapts an a-helical fold. A scrambled peptide 

342 from a previous SARS-CoV study (33) was chosen as the negative control, and the FP of HIV-1 

343 was selected as the positive control (46). To facilitate the synthesis of the peptides and increase 

344 their solubility, an aminocaproic acid (Ahx) linker followed by 3 Lys residues (Ahx-KKX) was 

345 added to the C-termini of the peptides. Consistent with the previous reports (46), while the FP of 

346 HIV-1 folded as a random coil in Tris/salt buffer, it formed an a-helix in the presence of 

347 trifluoroethanol (TFE) (Fig. 7A), a solvent known to stabilize the a-helical formation (47). 
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348 Similarly, in the absence of TFE, the pFP of MERS-CoV (SSLLGSIAGVGWTAGLSSFAAI) 

349 folded as a random coil, but with the addition of TFE, it folded as an a-helix. At 95% of TFE, 

350 helical population accounted for more than 64% (Fig. 7A). 

351 FPs of Class-I viral fusion proteins also promote membrane fusion when mixed with 

352 liposomes. Accordingly, we investigated whether the pFP of MERS-CoV S protein could 

353 mediate liposome fusion using a FRET-based assay. To rule out any possible effect of the Ahx- 

354 KKK tag, we decided to use peptides without any tag. Flowever, because of the technical 

355 difficulty of synthesizing the hill length pFP without the AFIX-KKK tag, we decided to use 

356 instead the core sequence of pFP ( 955 IAGVGWTAGL 964 , called “short pFP” or sFP) in this study, 

357 in which almost all of the residues were shown to be essential for cell-cell fusion and virus entry. 

358 As shown in Fig 7B, both the FP of FIIV-1 and the sFP of MERS-CoV induced membrane fusion 

359 of liposome in a concentration dependent manner, whereas the negative control peptide did not 

360 induce any significant lipid mixing. Moreover, when we replaced V958 and W960, two residues 

361 essential for cell-cell fusion and virus entry, with Arg in the MERS-CoV sFP peptide, the 

362 resulting mutant FP (mFP) (955IAGRGRTAGL964) failed to induce any noticeable lipid mixing, 

363 confirming that these two residues are essential for lipid mixing. 

364 Flaving established the essential roles in membrane fusion and virus entry of the pFP of 

365 the S protein MERS-CoV, a P-CoV in group C, we also investigated the functional role of the 

366 pFPs of other CoVs. After examining the alignment of the pFPs of different CoVs (Fig. IB), we 

367 selected the pFPs of the S proteins of SARS-CoV, a lineage B P-CoV, and MF1V, a lineage A P- 

368 CoV, for functional study. While the pFP of SARS-CoV shares the same length and has about 

369 1/3 of amino acid sequence identity with the pFP of MERS-CoV, the pFP of MF1V differs 

370 markedly from that of MERS-CoV in both length and amino acid sequence. Since hydrophobic 
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371 residues in the pFP of MERS-CoV play important roles in membrane fusion, we selected W868, 

372 F870, L876 and 1878 of SARS-CoV S protein and M936, F937, P938, P939, and W940 of MHV 

373 S protein for further analysis. Single Arg and/or Gly substitutions were introduced into the MHV 

374 and SARS-CoV S proteins at these positions. 

375 With the exception of 1878-related mutants, the pFP mutant S proteins of SARS-CoV 

376 were expressed well (data not shown), bound well to its receptor, hACE2, at levels similar to WT 

377 (data not shown), and were incorporated into pseudovirions efficiently (Fig 8B). 1878 mutants 

378 (I878G, I878R, and double mutant L876G/I878G) were expressed slightly less well in cell 

379 lysates (data not shown) and showed reduced S protein incorporation into pseudovirons (Fig. 8B), 

380 indicating that 1878 may play a role in folding and transport of S protein. Similar to MERS-CoV 

381 S protein, all Arg mutations in pFP of SARS-CoV effectively abolished S protein mediated cell- 

382 cell fusion and virus entry (Fig. 8A, 8C, and Table 1), suggesting that these residues are indeed 

383 essential for membrane fusion. Compared to Arg mutations, Gly substitutions in the pFP of 

384 SARS S protein had less effect on cell-cell fusion and virus entry. Interestingly, although the 

385 single mutants, W868G and F870G, showed almost WT level infection, the double mutant 

386 W868G/F870G abolished S protein mediated virus entry (Fig. 8A), confirming that these two 

387 residues in S protein of SARS-CoV are important for membrane fusion. 

388 All MHV S protein pFP with single Arg substitutions (M936R, F937R, P938R, P939R, 

389 and W940R) showed significant reduction in both S-mediated pseudovirion entry (Fig. 8D and 

390 Table 1) and cell-cell fusion (Fig. 8F and Table 1). S proteins with M936R substitutions, 

391 however, showed significantly decreased expression of S protein in cell lysate (data not shown) 

392 and incorporation into pseudovirions (Fig. 8E). This may partly explain why M936R mutations 

393 had detrimental effects on vims infection and cell-cell fusion. P938R substitution also showed 
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394 slight reduction in expression and virion incorporation of S protein. In contrast, S proteins with 

395 F937R, P939R, and W940R substitutions had wild-type levels of S protein expression (data not 

396 shown) and incorporation into virions (Fig. 8E), and binding to its cognate receptor (data not 

397 shown), mCEACAMla, but failed to mediate virus entry or syncytia formation. These data 

398 indicate that F937, P939, and W940 in the pFP may be essential for MHV S protein-mediated 

399 membrane fusion. 

400 Discussion. 

401 Proteolytic priming is one of the early essential steps required to activate the fusion 

402 potential of Class-I viral fusion proteins, and is believed to release the restrain on the viral FP 

403 leading to exposure of the FP. The proteolytic priming sites for most of the Class-I viral fusion 

404 proteins are either immediately proximal to or not far upstream of the viral FP (21-26). Therefore, 

405 identifying the key proteolytic priming site may lead to discovery of a viral FP. However, in the 

406 case of Co Vs, the priming sites are less clear. In an attempt to identify the trypsin cleavage site 

407 essential for MERS-CoV S protein mediated trypsin-dependent entry, we mutated several trypsin 

408 sites (R884G/R887G, K897G, R921G, and K933G) upstream of the N-terminus of HR-N of 

409 MERS-CoV S protein (48, 49). Surprisingly, we found that none of these sites was essential for 

410 trypsin-primed MERS-CoV S protein-mediated virus entry (Data not shown). Therefore, there 

411 might be built-in redundancy of trypsin priming sites within the MERS-CoV S protein such that 

412 cleavage by trypsin might occur at multiple sites and single cleavage at any one of these sites 

413 might be sufficient to prime the MERS-CoV S protein. 

414 Since there was not a single essential trypsin priming site for the S protein of MERS-CoV, 

415 we used an alternative approach to look for the FP of MERS-CoV S protein. Using TMpred and 

416 TMHMM software programs to analyze the S2 domains of a variety of CoVs, we identified a 
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417 region in S2 that is flanked by YT at the N-terminus and PF at the C-terminus and found in all 

418 the CoVs studied (Figs IB and 1C). This pFP region has characteristics of the known FPs of 

419 other Class-I viral fusion proteins, Gly or Ala rich, relatively hydrophobic, and without charged 

420 residues. This pFP region is located at about 7-23 amino acids upstream of the N-terminus of 

421 FIR-N of CoV S proteins, depending on where the N-terminus of FIR-N was proposed (48-53). 

422 Mutagenesis analysis on the pFPs of MERS-CoV, SARS-CoV, and MF1V S proteins revealed 

423 that this region was essential for S protein mediated syncytia formation and virus entry (Table 1), 

424 and strongly support the idea that the pFP of (3-CoV S protein is the functional viral fusion 

425 peptide. This conclusion is further strengthened by our findings that the synthetic pFP of MERS- 

426 CoV S protein formed an a-helix in the presence of TFE and its core short sequence, called sFP, 

427 mediated membrane fusion of liposome efficiently (Fig 7), which are characteristics of FPs of 

428 other Class-I viral fusion proteins (13). Our results are also consistent with previous biophysical 

429 studies on synthetic peptides from SARS-CoV S protein (29, 33) and previous studies in MF1V 

430 showing that P939 may be critical for membrane fusion and virus infection (54, 55). 

431 About one third of the residues located at the C-terminus of the pFP of MERS-CoV S 

432 protein appear to play important roles in the stability and processing of the S protein, since 

433 introduction of amino acid substitutions into these positions significantly reduced S protein 

434 expression, processing and incorporation into pseudotyped virions. Residues close to the C- 

435 terminus of the pFP of the SARS-CoV S protein also appear to be important for S protein folding, 

436 as replacement of 1878 with R or G also decreased S protein expression and incorporation into 

437 virions. Flowever, this region might also be important for membrane fusion mediated by S 

438 protein. A recent study on SARS-CoV by Liao et al (56) raised the possibility that this region 

439 might make direct interactions with the JMD in S protein during membrane fusion. 
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440 Among the amino acid substitutions that we introduced into the pFP of MERS-CoV S 

441 protein, Arg had a more profound effect on the function of the pFP of MERS-CoV S protein than 

442 Gly, Ala, or Val. Compared to Gly, Ala, and Val, Arg is positively charged and its side chain is 

443 significant longer than Gly, Ala, or Val, hence Arg substitution represents a more dramatic 

444 change than these amino acids. Moreover, Arg substitution may cause a higher free energy 

445 barrier for insertion of FP into membrane (57). Although the exact mechanism(s) of how these 

446 substitutions in the pFP abrogate membrane fusion requires further investigation, there are 

447 several possibilities. Introduction of mutation(s) into the pFP of MERS-CoV S protein might 

448 distort the structure of FP required for membrane fusion similar to G1V and W14A mutations of 

449 the FP of influenza FIA (58-60). Alternatively, the substitutions might change how the FP inserts 

450 into membranes (61-63), or affect the oligomerization of the FPs that is important for membrane 

451 fusion (64, 65). 

452 Recent studies in influenza FIA (66), paramyxovirus F protein (67), and FIIV Env (68) 

453 reveal that many viral FPs interact and oligomerize with their TMDs in the lipid, which promotes 

454 lipid mixing and membrane fusion. Whether the FP and TMD of CoV S protein interact with 

455 each other during membrane fusion remains to be further determined. Interestingly, the primary 

456 amino acid sequences of the TMDs among different CoVs also do not share high identity (Fig 9). 

457 Of note, there is a GXXXG or (small)XXX(small) motif (G, Gly; small, Ala or Gly or Ser; X, 

458 any residue) present in all of the pFPs of CoVs. These motifs were initially discovered in human 

459 glycophorin A and have subsequently been implicated in TMD interactions of more than 20 

460 proteins (69). Recent studies in influenza HA and HIV Env have suggested that such GXXXG 

461 motifs may also play an important role in FP:FP or FP:TMD interaction (66, 68, 70). There are 

462 two GXXXG motifs, GSIAG and GWTAG, within the FP of MERS-CoV. Replacement in 
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463 MERS-CoV S protein of any one of these four Gly residues (G953, G957, G959, or G963) with 

464 Arg abrogated the membrane fusion activity of the viral protein. However, whether these 

465 GXXXG motifs in the pPF of MERS S protein are essential for oligomerization or interaction 

466 with the TMD requires further investigation. 

467 FPs of some Class-I viral fusion proteins, like HIV Env and influenza HA, share high 

468 identity in primary amino acid sequence within each virus family. In marked contrast, this study 

469 found no strong amino acid sequence identity among the pFPs of MERS-CoV, S ARS-CoV, and 

470 MHV. The lengths of the FPs of these three different lineages of (3-Co Vs also differ significantly, 

471 ranging from 18 for MHV to 22 amino acids for MERS-CoV and SARS-CoV (Fig. IB). Within 

472 each lineage of (3-CoVs, the pFPs appear to be better conserved (Fig IB). Although underline 

473 mechanism(s) causing the amino acid sequence diverge of FPs of different lineages of (3-CoVs 

474 remains to be determined, CoV RNA-dependent RNA polymerase error, recombination, and 

475 selective pressure during evolution likely contribute to these changes. Previous study of MHV 

476 persistent infection in DBT cells showed that accumulation of mutations in fusion peptide and 

477 HR-N could lead to extending host range (55). The lack of conservation of the pFP amino acid 

478 sequences, however, is not unique for CoVs, as FPs from different paramyxoviruses also lack 

479 high identity in their primary amino acid sequences (67). 

480 As an internal fusion peptide, how does the activated FP of CoVs fold and mediate 

481 membrane fusion? Recent studies have demonstrated that FPs from different Class-I viral fusion 

482 proteins might adapt different conformations to mediate membrane fusion. Depending on the 

483 lipid composition, the FPs of HIV-1 Envs and PIV F proteins can fold as either a-helix (67, 71) 

484 or (3-sheet (65, 72), and both can be fusiogenic. In contrast, the overall conformation of the FPs 

485 of Ebola Gp and influenza HA is a-helical in the presence of TFE, but they fold as hairpin-like 
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486 structure or “knuckle” conformations when they insert into their target membranes (63, 73). 

487 Sequence analysis of the S proteins of different CoVs (Fig IB) shows the presence in the pFPs of 

488 a Gly-Gly (GG) motif in a-, y-, and 5-CoVs or a Pro-Pro (PP) motif in P-CoVs in lineage A. As 

489 GG and PP motifs favor the formation of turn or hairpin structures, this observation suggests that 

490 the FPs of some CoVs might also adapt a hairpin-like structure when inserting into host 

491 membranes. In the FPs of SARS-CoV and MERS-CoV in P-CoV groups b and c respectively, 

492 however, neither a GG nor a PP motif is present. Of note, the FP from group 2 influenza HA also 

493 lacks a central GG or PP motif, but instead it forms a hairpin-like structure with G13 at the turn 

494 with a Trp and a hydrophilic residue immediately following G13 (74). Interestingly, a similar 

495 motif is also present in the pFPs of SARS-CoV and MERS-CoV (Fig IB). 

496 While all known Class-Ill viral fusion protein have two fusion loops, all known Class-I 

497 viral fusion proteins except for CoV S protein only have a single fusion peptide. In the case of 

498 CoVs, in addition to pFP found in this study, Manu et al previously found a highly conserved 

499 region in SARS-CoV S protein essential for membrane fusion and proposed it as the possible 

500 fusion peptide (32), although this sequence lacks some common features of FPs of other Class-I 

501 viral fusion proteins, including high Ala/Gly content. Their proposed FP is about 80 amino acids 

502 away from the N-terminus of HR-N (50, 52) and about 40 amino acids upstream of the N- 

503 terminus of our pFP. The possibility of presence of two possible fusion peptides in the S protein 

504 of CoV is very intriguing. How these two possible fusion peptides collaborate to mediate 

505 membrane fusion requires further investigation. 

506 In summary, using a bioinformatics approach we have identified a region in the S 

507 proteins of CoVs that has several properties the FPs of several classical Class-I viral fusion 

508 proteins. Further molecular biological, biochemical, and biophysical analyses demonstrated that 
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509 this region is essential for receptor-dependent membrane fusion mediated by S proteins of 

510 several |3-CoVs in different lineages, strongly suggesting that it is the functional FP of these and 

511 likely other CoVs. These findings will provide significant clues for future studies of the 

512 membrane fusion mechanism of CoVs and may provide a new target for drugs against CoV 

513 infections. 
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521 Figure legend. 

522 Figure 1. pFPs of CoVs. (A) Diagram of CoV spike protein. NTD, N-terminal domain; C- 

523 domain, C-terminal domain; Cleavage site, protease cleavage site separating SI and S2; pFP, 

524 possible fusion peptide; FIR-N, N-terminal heptad repeat; FIR-C, C-terminal heptad repeat; 

525 JMD, Juxtamembrane domain; TMD, transmembrane domain. (B) Locations of pFPs and TMDs 

526 of S proteins of representative CoVs predicted by TMPred. (C) Amino acid sequence alignment 

527 of the pFPs of different CoVs. (D) Summary of the amino acid substitutions made in the pFP of 

528 MERS-CoV S protein. 

529 Figure 2. Analysis of expression ofpFP mutants of MERS-CoV S protein in 293T cells. (A) 

530 Western blot analysis of expression of WT or mutant MERS S protein in cell lysate. The MERS 

531 S protein was detected by using mouse monoclonal anti-MERS S antibody; (3-actin was detected 
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532 with mouse monoclonal anti-actin antibody. (B) Analysis of surface expression of mutant 

533 MERS-CoV S protein by flow cytometry. MERS-CoV S protein expressing 293T cells were 

534 stained with mouse monoclonal anti-MERS S antibody. The amount of wild-type S protein on 

535 cell surface was set as 100%. All of the experiments shown were repeated at least three times. 

536 Figure 3. Receptor binding by mutant MERS S proteins. MERS-CoV S protein expressing 293T 

537 cells were incubated with soluble AVI-tagged hDPP4, followed with polyclonal rabbit anti-AVI 

538 antibody and FITC conjugated goat anti-rabbit IgG. The results from wild-type were set as 100%. 

539 Figure 4. Cell-cell fusion mediated by WT or mutant MERS-CoV S protein. MERS-CoV S 

540 protein expressing 293 T cells were transiently transfected with eGFP, then incubated with 

541 FIeLa/hDPP4 cells for overnight in the presence of trypsin. 

542 Figure 5. Quantitative analysis of syncytia formation mediated by WT or mutant MERS-CoV S 

543 protein. Cell-cell fusion was quantified by measurement of luciferase activities. Typically, the 

544 relative luciferase activities from cell-cell fusion induced by wild-type S protein were over 10 7 , 

545 while the reading for mock control was less than 1000. The experiments were done at least three 

546 times. 

547 Figure 6. Entry of pseudotype virions with wild-type or mutant MERS S protein. A. Entry of 

548 pseudovirions with wild-type or mutant MERS-CoV S proteins into FleLa/hDPP4 cells. 

549 Pseudovirus entry was quantitated by luciferase activity at 40 hrs post inoculation. A typical 

550 transduction by wild-type S protein pseudoviruses resulted in increase of over 10,000-fold of 

551 luciferase activity. The experiments were repeated at least three times and an average of three 

552 experiments is shown. B. Detection of wild-type or mutant S protein incorporation into 

553 pseudovirions by western blot analysis. MERS S protein was detected using mouse monoclonal 

554 anti-MERS S antibody; p24, a gag protein of FIIV, was detected using rabbit polyclonal anti-p24 
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555 antibodies. FL S: full length S protein. The experiments were repeated twice and a representative 

556 is shown. 

557 Figure 7. Biophysical analysis of synthetic pFP peptide of MERS-CoV. A. CD analysis of 

558 secondary structure of pFP of MERS-CoV S protein. CTRL: KWGQYTNSPFLTKGF- Ahx- 

559 KKX, a control peptide from previous SARS-CoV peptide study (33); FHV FP: 

560 AVGIGALFLGFLGAAG-Ahx-KKK; MERS pFP: SSLLGSIAGVGWTAGLSSFAAI-Ahx- 

561 KKX. All peptides were dissolved in PBS, and their CD spectrum was measured in the presence 

562 of indicated concentration of TFE. Experiments were done twice and one representative is shown. 

563 B. Lipid mixing induced by synthetic pFP of MERS-CoV S protein. LUVs were made with 

564 equal moles of PE, PC, and cholesterol. The extent of lipid mixing was determined by 

565 monitoring the changes in fluorescence intensity at 535 nm at 37°C upon addition of peptide. 

566 Each data point is averaged from three independent experiments, and error bars represent 

567 standard deviations of the means. CTRL: KWGQYTNSPFLTKGF; HIV FP: 

568 AVGIGALFLGFLGAAG; MERS sFP: IAGVGWTAGL; MERS mFP: IAGRGRTAGL. 

569 Figure 8. Effects of mutations at the pFPs of SARS-CoV and MHV on pseudovirus transduction 

570 and cell-cell fusion. A, D. Entry of wild-type or mutant SARS-CoV S protein pseudovirions into 

571 293/hACE2 cells (A) or MHV S protein pseudovirions into HeLa/mCEACAMla cells (D). 

572 Pseudovirus entry was quantitated by luciferase activity at 40 hrs post inoculation. The 

573 experiments were repeated at least three times and average of three experiments is shown. B, E. 

574 Detection of wild-type or mutant S protein of SARS-CoV (B) or MHV (E) incorporation into 

575 pseudovirions by western blot analysis. SARS S protein was detected using rabbit polyclonal 

576 anti-SARS SI antibody; MHV S protein was detected using goat polyclonal anti-MHV S 

577 antibody A04; p24, a gag protein of HIV, was detected using rabbit polyclonal anti-p24 
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578 antibodies. The experiments were repeated at least three times and one representative is shown. 

579 C, F. Cell-cell fusion mediated by mutant SARS (C) or MFIV (F) S proteins. Experiments were 

580 performed as in Fig. 3B, except that 293/hACE2 cells were used as targets for SARS-CoV S 

581 protein (C) and FleLa/mCEACAMla cells were used as targets for MFIV S protein (F). An 

582 average of three experiments is shown. 

583 Figure 9. Alignment of TMDs of S proteins of representative CoVs. 

584 
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Table I Summary of pFP mutants of betacoronaviruses 


Expression Expression on Incorporation Receptor Cell-cell Pseudovirion 
in cell lysate cell surface in viron binding fusion transduction 



S protein expression on cell surface and receptor binding: +++, >70% of WT; ++, 46-70% of WT; 
+, 20-45% of WT; -, <20% of WT. ND, not done. 

Cell-cell fusion and pseudovirion transduction: +++, >70% of WT; ++, 31-70% of WT; 

+. 5-30% of WT; -, <5% of WT. ND, not done. 
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